Learning by Exploration Including Risk

نویسندگان

  • Sylvia Wiebrock
  • Fritz Wysotzki
چکیده

In this paper we describe how methods of concept (classi cation) learning can be used to solve constraint satisfaction problems in the spatial domain, for example if a set of numerical constraints de nes a complicated region in the physical or feature (parameter) space. In this case it is in general not easy or even not possible to nd explicit boundary descriptions by some kind of formula manipulation system. Formally, the task is learning a decision function A(x1; : : : ; xn) which decides whether a vector x = (x1; : : : ; xn) belongs to a region where the predicate A is true, i.e. the corresponding constraints are satis ed. In the examples given below, the constraints consist of equations and inequations containing trigonometric functions which lead to computational di culties well known in robotics (cf. Ambler and Popplestone (1975) and Taylor (1976)). Seen from the cognitive point of view, our claim is that at least parts of human knowledge (facts and general rules) are learned by experience and directed exploration using the type of inductive inference introduced in this paper. Learning rules (logical implications) of the form 8x(A(x1; : : : ; xn) ! B(x1; : : : ; xn)) is the more general case which can be handled by our method, too (see Sect. 2.1). In the following we identify a region where a constraint A is satis ed with a class (concept) A. Now algorithms of classi cation learning (e.g., decision tree learning or Neuronal Nets) by means of a training set construct classi ers which | by inductive generalization | decide the class membership of an arbitrarily chosen x (not necessarily contained in the training set). Usually, there is a generalization error due to the unavoidable approximation of the boundaries between the A-region and the not A-region. The generalization error can be measured using a test set of classi ed example vectors di erent from the training set. Decision tree learners approximate the class boundaries piecewise linearly by axis-parallel hyperplanes, generalized Perceptrons by hyperplanes in general position. In the case of rule learning, the implication 8x(A(x) ! B(x)) can be \decided" within the limits of approximation error by testing whether every x taken from the test set and classi ed as belonging to the A-region also belongs to the B-region. This procedure will be demonstrated by means of an example problem taken from the area of spatial reasoning using the decision tree learner CAL5 (M uller and Wysotzki 1994) and the neuronal net DIPOL (Schulmeister and Wysotzki 1997) to learn the semantics of spatial relations like right or

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Separating Well Log Data to Train Support Vector Machines for Lithology Prediction in a Heterogeneous Carbonate Reservoir

The prediction of lithology is necessary in all areas of petroleum engineering. This means that to design a project in any branch of petroleum engineering, the lithology must be well known. Support vector machines (SVM’s) use an analytical approach to classification based on statistical learning theory, the principles of structural risk minimization, and empirical risk minimization. In this res...

متن کامل

Exploration of Arak Medical Students’ Experiences on Effective Factors in Active Learning: A Qualitative Research

Introduction:: Medical students should use active learning to improve their daily duties and medical services. The goal of this study is exploring medical students’ experiences on effective factors in active learning. Methods: This qualitative study was conducted through content Analysis method in Arak University of Medical Sciences. Data were collected via interviews. The study started with p...

متن کامل

Safety, Risk Awareness and Exploration in Reinforcement Learning

Safety, Risk Awareness and Exploration in Reinforcement Learning by Teodor Mihai Moldovan Doctor of Philosophy in Computer Science University of California, Berkeley Professor Pieter Abbeel, Chair Replicating the human ability to solve complex planning problems based on minimal prior knowledge has been extensively studied in the field of reinforcement learning. Algorithms for discrete or approx...

متن کامل

How Agile is Agile Enough? Towards A Theory of Agility in Software Development

One poorly investigated issue in organizational agility is the question how organizations change their speed while adopting and exploiting new IT capability. In this paper we outline a theory of software development agility that draws upon a model of IT innovations by Swanson and on March’s learning theory and in particular on his concepts of exploration and exploitation. We explore how both ex...

متن کامل

Probabilistic Exploration in Planning while Learning

In decision-theoretic planning Pemberton and Korf (1994) have proposed separate heuristic functions for exploration and decision-making in incremental real-time search algorithms. Draper et al. (1994) have developed a probabilistic planning algorithm that performs both information-producing actions and contingent planning actions. Our exploration strategy could be applied to these planning task...

متن کامل

Optimizing the Grade Classification Model of Mineralized Zones Using a Learning Method Based on Harmony Search Algorithm

The classification of mineralized areas into different groups based on mineral grade and prospectivity is a practical problem in the area of optimal risk, time, and cost management of exploration projects. The purpose of this paper was to present a new approach for optimizing the grade classification model of an orebody. That is to say, through hybridizing machine learning with a metaheuristic ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999